Micro-blog Keyword Extraction Method Based on Graph Model and Semantic Space

نویسندگان

  • Hua Zhao
  • Qingtian Zeng
چکیده

There have been many domain-specific keyword extraction researches, but micro-blogoriented keyword extraction is just beginning. This paper researches into the keyword extraction from Chinese micro-blog. Taking the characteristics of micro-blog into account, such as short, topic divergence, etc., we propose a Chinese micro-blog keyword extraction method based on the combination of multi features. Firstly create the graph model based on the co-occurrence between words, get a kind of weight based on the created graph model. The weight based on the graph model is sometimes same. In order to solve this problem, this method secondly proposes to create the semantic space based on the topic detection method, and get the statistical weight based on the semantic space. Finally, we take the location of words into account during the extraction, which is proved to be a very effective feature. Experimental results show that the proposed keyword extraction method is very successful.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating an Indoor space routing graph using semantic-geometric method

The development of indoor Location-Based Services faces various challenges that one of which is the method of generating indoor routing graph. Due to the weaknesses of purely geometric methods for generating indoor routing graphs, a semantic-geometric method is proposed to cover the existing gaps in combining the semantic and geometric methods in this study. The proposed method uses the CityGML...

متن کامل

Integrating Semantic Relatedness and Words' Intrinsic Features for Keyword Extraction

Keyword extraction attracts much attention for its significant role in various natural language processing tasks. While some existing methods for keyword extraction have considered using single type of semantic relatedness between words or inherent attributes of words, almost all of them ignore two important issues: 1) how to fuse multiple types of semantic relations between words into a unifor...

متن کامل

Natural Language Processing and Web Mining Application of Social Analytics for Business Information Systems

Social networking tools, blogs and microblogs, user-generated content sites, discussion groups, problem reporting, and other social services have transformed the way people communicate and consume information. Yet managing this information is still a very onerous activity for both the consumer and the provider, the information itself remains passive. Traditional methods of keyword extraction fr...

متن کامل

Micro-blog Personalized Query Expansion Based on Latent Topic Classification

With the increasing maturity of Web2.0 technology and development of micro-blog, the number of micro-blog pages is exponentially rising. Only relying on the traditional micro-blog search engine has not met the requirements of users. Aiming at that the retrieval efficiency of the traditional micro-blog searching method cannot meet the requirements of users, inspired by probabilistic latent seman...

متن کامل

UNIBA: Sentiment Analysis of English Tweets Combining Micro-blogging, Lexicon and Semantic Features

This paper describes the UNIBA team participation in the Sentiment Analysis in Twitter task (Task 10) at SemEval-2015. We propose a supervised approach relying on keyword, lexicon and micro-blogging features as well as representation of tweets in a word space.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Multimedia

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013